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Abstract — We present a solution to evaluate the performance of 
transport protocols as a function of link layer reliability schemes 
(i.e. ARQ, FEC and Hybrid ARQ) applied to satellite physical 
layer traces. As modelling such traces is complex and may 
require approximations, the use of real traces will minimise the 
potential for erroneous performance evaluations resulting from 
imperfect models. Our Trace Manager Tool (TMT) produces the 
corresponding link layer output, which is then used within the ns- 
2 network simulator via the additionally developed ns-2 interface 
module. We first present the analytical models for the link layer 
with bursty erasure packets and for the link layer reliability 
mechanisms with bursty erasures. Then, we present details of the 
TMT tool and our validation methodology, demonstrating that 
the selected performance metrics (recovery delay and throughput 
efficiency) exhibit a good match between the theoretical results 
and those obtained with TMT. Finally, we present results showing 
the impact of different link layer reliability mechanisms on the 
performance of TCP Cubic transport layer protocol. 

I. Introduction 

The performance of satellite communications is mainly 
driven by the bit-error rate and the link delay. In order to 
improve the quality of video broadcasting or safety communi- 
cations, existing interactions between reliability mechanisms 
at the link and other layers such as the transport layer must 
be considered. Data reliability can be operated independently 
at different levels of the communication stack (i.e. at every 
layers of the OSI model) and cross-layering techniques aim to 
optimize network usage while enabling better communications 
between individual layers. Reliability mechanisms exist at 
the transport layer (e.g. TCP enables an ARQ retransmis- 
sion schemes), at the application layer (e.g. AL-FEC) or at 
lower layers such as the physical and link layers (e.g. ARQ, 
HARQ, erasure codes). In 00, the authors present different 
mechanisms (FEC, ARQ and HARQ of type II) for the data 
transmission reliability but do not analyse their performance 
over a bursty channel. Using the results of 0, the authors in 
[4 1 show that bursty errors can be modeled as a Markov chain. 

While there are many studies that assess the impact of 
burst-correction codes on the physical layer level using this 
channel model 000, little attention has been directed 
to the link layer where the channel impairments, seen as 
packet erasures, are approximated with a Bernoulli channel. 
As a result, the performance of erasure correcting codes over 
bursty erasure channel at the link layer, in terms of trade- 
off between throughput efficiency and recovery delay, has not 



been extensively studied. Especially in the context of Land- 
Mobile Satellite (LMS) channel and geostationary satellite 
systems, it is more realistic to consider bursts of losses at the 
link layer level, while it is acceptable to neglect the impact 
of queuing delays and processing times as these are close to 
negligible compared to the transmission delay. Therefore, one 
contribution of this paper is an analytical tool that enables 
evaluations of the performance of different link layer reliability 
schemes. We propose to assess the impact of these link 
layer reliability mechanisms over a bursty erasure channel 
modelled by a Markov chain. The resulting algorithms enable 
us to evaluate the performance of reliability mechanisms over 
satellite links. 

To this end, we have developed a Trace Manager Tool 
(TMT) that computes the impact of a given reliability scheme 
at the link layer level as a function of a given physical 
layer trace. We combine TMT with the network simulator 
ns-2 to study the impact of link layer reliability mechanisms 
on the transport protocols. In 0, the authors use a similar 
approach. However, they consider a simple hybrid space- 
terrestrial network and only focus on one link layer scheme 
(ARQ). The authors in also consider link layer data inside 
ns-2. Compared to our proposal, they attempt to model the 
erasures and delay introduced by the reliability schemes at 
the link layer making their implementation inflexible and their 
results applicable to only a single physical layer model. To 
the best of our knowledge, there is a clear lack of tools 
allowing the evaluation of all existing reliability schemes at 
the link layer following real physical measurements, and our 
contribution fills the gap in this area while utilising the rich 
existing source of transport protocol implementations within 
a simulator such as ns-2. 

We have organised this paper as follows: in Section [II] 
we present a bursty channel model and the method used to 
estimate erasure probabilities. We detail the algorithms used 
to evaluate the throughput efficiency and the recovery delay 
induced by these mechanisms in Section [HI] In Section IIVI 
we present the cross validation between the theoretical results 
and TMT. Section [V] presents the TMT tool and the module 
developed in ns-2 to introduce the reliability schemes at the 
link layer. This section also illustrates the performance of TMT 
based on a TCP Cubic example. Finally, Section [VI] presents 
the conclusion and future work. 



II. Link layer bursty erasure packet model 

This section explains how we use the concept of a bursty 
bit error channel (physical layer) model to derive a bursty 
erasure packet (link layer) model. We base our analysis on 
the algorithms presented in IfTUl to model link layer reliability 
schemes with a slight adaptation. In iflOl , the authors propose 
two methods to express the error probabilities of an error 
correcting code over a bursty channel. In particular, they 
provide a complete expression and computation method for 
the bit error probability (i.e. at the physical layer level). In 
our context, we need to modify these results by considering 
erasures at the link layer. 

A. 2-State Markov Chain 
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Fig. 1. Bursty errors and bursty erasure models 



A 2-State Markov chain model is commonly used to repre- 
sent a bursty error channel at the physical layer level. The good 
state probability (resp. bad state) presents an error probability 
Pq (resp. Pb) and a changing state probability a (resp. j3). 
In the good state (resp. bad state) errors occur with low (resp. 
high) probability, which illustrates the bursty aspect of the 
channel. We also use this model with corresponding erasure 
probabilities to simulate bursty erasures at the link layer level 
as illustrated in Fig.Q] In the context of satellite transmissions, 
this model if of interest as long bursts of erasures might occur. 

B. Evaluation of Erasure Probabilities 

For an erasure code, the capability to recover data pack- 
ets depends on the number of erasures over both data and 
repair packets. The erasure probability distribution during a 
transmission over a channel with memory can be analysed 
through a 2-State Markov chain model, consisting of: the 
previous state of the chain, the probability to change the 
state and the corresponding erasure probability. As this 2- state 
Markov chain model applies to every packet, the totality of 



different erasure combinations over a number of packets can 
be considered through a mathematical induction. 

We now present the iterative methods used in the following 
analysis. Let P(m, n), be the probability of having m erasures 
over n packets, Pc{m, n) (resp. Pb(to, n)) the probability 
to have m erasures over n packets and to be in the good 
state (resp. bad state) when the n th packet is received. In 
order to compute P(m, n), we drive a double mathematical 
induction over m and n, considering first the current state of 
the chain, and then the current erasure probability. We validate 
the implementation of this algorithm by comparing it's results 
with the results provided in ifTOl . 

III. Modelling link layer reliability mechanisms 

WITH BURSTY ERASURES 

We now present the expressions for the throughput effi- 
ciency and the recovery delay for specific reliability schemes 
(FEC, ARQ and HARQ of type II) at the link layer level. 
These mechanisms are presented through an example in Fig. |2] 
In the following, a full TP packet' is fragmented into Link 
Layer Data Units (denoted LLDU) before transmission over 
the physical layer. 

Queuing delays and processing times are considered in stan- 
dard link layer models. As our analytical model is designed 
specifically for satellite links (with round trip times in the 
order of 400msec or greater), the impact of these additional 
delays can be neglected in comparison to the round trip time 
delay. 
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Fig. 2. Error control mechanism 



A. FEC: Forward Error Correction 

In the FEC scheme, the sender sends a combination of data 
and repair LLDUs. Let Nd (resp. Nr) be the number of 
data (resp. repair) LLDUs and N = Nd + Nr. The process 
to recover data LLDUs is successful if at least Nu LLDUs 
are received, otherwise (if the number of erasures is strictly 



greater than Nr) no correction is possible. First, we define 
the throughput efficiency as the ratio of the received LLDUs 
and the total number of LLDUs sent: 



Vfec 



(1) 



N D + N R 

where Pr{i) represents the probability that i LLDU are 
received. Over a bursty erasure channel, this is computed 
following the previously explained mathematical induction: 
Pr{^) = P{h^D + Nr) where P(m,n) is the probability 
of having m erasure in n LLDUs. 

Second, if a LLDU is erased, the additional delay will corre- 
spond to the time needed to receive the whole IP packet (data 
and repair LLDUs) needed by the FEC scheme to evaluate 
whether this IP packet can be recovered. This recovery delay, 
d, is related to the position of the LLDU in the total IP packet. 
On the average, we can consider the erasure to be located in 
the middle of the IP packet. With this hypothesis, the recovery 
delay d for packets at the receiver can be calculated as: 

d = ^ + P (l - £ P(i,N- 1)J ^T P (2) 

where Tp is the time needed to receive a LLDU and p the 
global erasure probability. 

B. Interleaved FEC 

Interleaving is an efficient and commonly used technique to 
improve the data transmission over a bursty channel, as erasure 
bursts can be spread into a number of different codewords. It 
is possible to change the characteristics of the channel with 
<[3j in order to consider the interleaving. Let p and p be the 
local erasure probability and the correlation between the states 
(considering a simplified channel with Pq = and Pb = 1). 
We have: 
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If an interleaver with a depth / is used on this bursty 
channel, we obtain a new bursty channel with the following 
changing state probabilities ai and /?/: 

ai — V + P 7 -(l —p) and Pi — (1 - p) + p 1 .p 

The performance of interleaved FEC can be then obtained 
by applying parameters ai and j3j in equations (Q]) and ©■ 

C. ARQ: Automatic Repeat- reQuest 

Automatic Repeat-reQuest mechanism at the link layer level 
consists in the retransmission of the LLDU that have been 
lost during the transmission. The throughput efficiency (also 
called goodput which is, by definition, the application layer 
throughput) corresponds to the probability that a LLDU is 
received. In the context of high delay links, the channel 
probably changes its state before retransmissions are sent. 



Thus, we do not consider burst of erasures when using ARQ. 
Furthermore, we can neglect this notion as this scheme does 
not introduce correlation between different LLDU of the same 
IP packets. Then, the recovery delay can be expressed as 
follows: 

d ARQ = ^-+Y j p l - 1 {l-p)i.RTT 

where p is the global erasure probability. 

D. HARQ-II: Hybrid ARQ of type II 

This mechanism is a combination of the FEC and ARQ 
mechanisms and after the first transmission of a FEC block, 
including data and repair LLDUs, HARQ-II allows the sender 
to send additional repair LLDUs when a recovery is not 
possible at the receiver side. In other words, if no correction is 
possible at the receiver, the transmission of additional repair 
LLDUs is requested by the receiver. At each new transmission, 
the sender transmits more LLDUs than requested by the 
receiver: if the receiver requires n LLDUs to recover the data, 
the transmitter sends (n+Ns) LLDUs when N$ is the number 
of supplementary repair LLDUs sent. Let R,. be the probability 
that the data can be decoded after r retransmissions, Tp(r) the 
time needed to receive the LLDU of the r th retransmission, 
Np, the number of data source LLDUs, Nr the number of 
repair LLDUs, and N = Nb> + Nr. For applications with time 
constraints, a limited number of authorized retransmissions, 
denoted by R, is considered. 

1) Throughput Efficiency: The throughput efficiency for 
HARQ-II is expressed as the ratio of the received LLDUs 
and the total number of LLDUs sent: 
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where Pr{i) is the probability that i LLDU are received 
and Ps{j) the probability that j LLDU are sent. 

N D /ft-1 \ W D -1 

^ p R (i).i = lj2 R * )- Nd + £ Pr ^- 1 (4) 

i=l \z=0 / i=l 

with R = 2 (2 complementary transmissions are au- 
thorized), (|4|i can be calculated according to the following 
expression: 

P R (i,i<N D ) = P(N D -i,N D ) 

N R (N D -i)+N s -N R +h h+N s 
x X] X! Ai,Nr>,N R ,N s ,luh,l3 



with: 



Ai,N D ,N R ,N s ,h,l 2 ,l 3 =P(h,N R ) 

xP(l 2 ,(N D -i) + Ns-N R + h) 
xP(l 3 ,l 2 + N s ) 
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IV. Cross-validation and interpretation 



if (N D ~i)>N R 
{N D - ») - N R if (N D -i)< Nr 

1 if (N D — i) = Nr 

We consider every combination of erasure positions to 
determine Ps(j)- For each complementary transmission, the 
number of repair LLDU sent is linked to the current number of 
erasures. If there are n erasures at the first IP packet (data and 
repair LLDU) sent, and if the correction capacity of the code 
is Nr, there are two possibilities: if n ^ Nr, no transmission 
of repair LLDU is needed; if n > Nr the receiver requests for 
n — Nr + Ns repair LLDU. The expressions (0 are given with 
R = 2 and with P(m, n), the probability to have m erasures 
over n LLDU. 

Ps(j)=J2S N .P(l ,N) 

lo=0 
N N s 

lo=N R + l h=Q 
N lo+Ns 

+ Yl Yl $io,h,N-P(lo,N).P(h,lo-NR + N s ) 

lo=N R +l h=N s +l 

(5) 

with: 
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lif j = N + {l - N R + N S ) 
Oif j^N+(l -N R + N s ) 
f 1 if j = N + (lo - Nr + N s ) + h 

1 if j ? N + {l - N R + N s ) + h 



2) Recovery Delay: We have to consider both the time 
needed to receive the first FEC block (data and repair LLDU), 
Tr(Q), and the additional repair LLDU. This recovery delay, 
denoted (Iharq can be expressed as follows: 



dHARQ = Tr(0) 



with RTT > T R (i 
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E. Conclusion 

The theoretical expressions developed in this section will 
be used: 

• to validate the TMT tool presented in the following 
section; 

• to quickly estimate the performance of the link layer 
reliability schemes. 



In this section, we present the TMT tool and then measure 
the resulting throughput efficiency and recovery delay over 
link layer output. We then cross-validate the TMT tool results 
with the theoretical metrics previously in Section [HI] 

A. The Trace Manager Tool: TMT 

We present the Trace Manager Tool that implements the 
standard link layer reliability schemes (i.e. ARQ, FEC, H- 
ARQ). 

The input data of TMT consists in a list of parameters 
(reliability scheme used, Nd, Nr, RTT, etc.) and the physical 
layer trace considered. We propose two ways to use the 
physical layer input, depending on the origin of the erasures: 

> direct use of the physical layer trace: the physical traces 
are measured and erasure events occur at the link layer 
according to real channel evolutions and to the most 
recent codes at the physcial layer; 

« indirect use of the physical layer trace: erasures are 
introduced on one error-free input trace following a 2- 
State Markov chain model as explained in Section [EI] 

In the use case presented, the physical trace corresponds to 
a satellite data transmission with a duration of of 500 seconds 
and has been provided by courtesy of CNEsQ- As the physical 
trace provided is error-free, we thus introduce bursty eras ures 
over this physical layer trace following the Markov chain 
model presented in Section [TT] TMT computes the equivalent 
output link layer trace according to the input trace and the 
chosen reliability scheme. We only keep useful data LLDU 
(and not repair LLDU or retransmission) and adapt their 
decoding delay according to the reliability scheme chosen. 
Then, we compute the throughput efficiency (i.e. goodput) and 
the recovery delay. 

The basic principle of TMT is to map available LLDU at the 
link layer to incoming IP packets and to schedule the emission 
of the packets at the transport layer level conjointly with the 
reliability schemes introduced at the link layer level. Later in 
Section |VJ we illustrate how TMT allows to assess the impact 
of link layer reliability schemes on transport layer protocols 
performance. 

B. Validation 

For each state, we compute the theoretical metrics through 
the equations detailed in Section [TIT] and the resulting met- 
rics obtained with TMT. We present in Fig. |3] the results 
obtained on a given set of parameters. The chosen parameters 
are: RTT = 500ms, N D - FEC = 10, Nr-fec = 12, 
Njj—harq = 5, Nr- HA r Q = 7, a = 0.99, G [0.1; 0.98], 
which induced a global erasure probability p £ [0.01; 0.3] and 
a length of erasure bursts £{, £ [1;50]. Both figures confirm 
that the theoretical expressions developed fits TMT results. 
Note that we only present a subset of our experiments and that 
several other set of parameters have been tested with success. 

'CNES is a government agency responsible for shaping and implementing 
France's space policy in Europe, see http://www.cnes.fr/ 
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Fig. 3. Validation of the throughput efficiency and the recovery delay 



C. Interpretation 

We propose to exploit the theoretical expressions given in 
Section|III]to compare the three recovery mechanisms in terms 
of recovery delay and throughput efficiency over a bursty 
channel. For the simulation, we use the following parameters: 
RTT = 500ms, N D = 38, N R = 13, R = 2, a = 0.98, 
(3 G [0.1; 0.93], which induces a global erasure probability 
p G [0.01; 0.3] and a length of erasure bursts % G [1; 14], 
Please note that the interpretation of the following results is 
limited to the given parameters. 

The results presented in Fig. [4] have been obtained with 
MATLAB. We note that ARQ and HARQ can transmit sup- 
plementary LLDUs if the IP packet cannot be rebuilt. In 
the context of satellite links, the delay resulting from the 
retransmissions impacts the data delivery and although these 
retransmissions enable the recovery of lost LLDUs at a later 
time, they may not be gainfully utilized by the time con- 
strained applications. When erasure occurrence is low, ARQ 
demonstrates better performance than HARQ as the transmitter 
does not send useless repair LLDU. Therefore, when erasure 
occurrence is higher, HARQ introduces less delay thanks to 
the initial repair LLDUs. Although the transmission can be 
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Fig. 4. throughput efficiency and recovery delay: use case 



reliable with both ARQ and HARQ schemes, the introduced 
delay needs to be considered in the design of networks with 
time constraints. The theoretical models presented in this paper 
allow a fast analysis of the performance of reliability schemes 
over various channels and can consequently assist the network 
designer with the choice of the most appropriate scheme to be 
used. 

V. Practical used of TMT with ns-2 

We now present how to play TMT resulting output trace 
with the network simulator ns-2. We first generate link layer 
trace with TMT following the studied physical layer trace. The 
ns-2 simulator allows to drive simulation based on external 
traces. As a result, ns-2 loads the link layer trace and the 
standard ns-2 queuing mechanism uses the arrival and de- 
coding times of each link layer data unit to evaluate the IP 
packet sending time. In this simulation, we consider a simple 
link between a satellite and a gateway. The transport layer 
uses TCP Cubic protocol. We aim to observe the performance 
of this transport protocol in terms of number of packets sent 
and retransmissions (at both link and transport layers) over 
different link layer reliability schemes. 

We use the same error-free physical trace previously intro- 
duced in Section [TV] The objective of the study is to estimate 
the impact of reliability mechanisms when different losses 
distribution are introduced over this channel. We thus intro- 
duce bursty erasures over the physical layer trace following 



the Markov chain model presented in Section (TTJ the mean 
burst length corresponds to 12.5 link layer data units and the 
mean erasure probability is 20% (a = 0.98 and j3 = 0.92). 
We propose to focus on the ARQ and HARQ mechanisms 

(Nd—harq = 10, Nr—harq = 20). 
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Fig. 5. Comparaison between ARQ and HARQ at the link layer 
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TABLE I 

Comparison of the probability of having a specific number of 
(re)transmissions for arq and harq 



We sum up a sample of the results obtained in Fig[3] and 
Table]]] Please note that we do not set the parameters to 
optimize the transmission. The objective is only to illustrate 
the capability of TMT. This example shows that with the 
same transport protocol, HARQ of type II at the link layer 
sends more packets than ARQ. As shown in Fig|5] we observe 
that with ARQ scheme (resp. HARQ of type II), the receiver 
acknowledged 2000 IP packets (resp. 4100 IP packets). As the 
number of retransmissions at the link layer is limited, the zoom 
square in Figj5] shows that an IP packet is dropped and then 
retransmitted. For both reliability schemes, packets which are 



not useful are sent (redundancy packets for HARQ, and lost 
packets for both schemes) but HARQ introduces less delay 
(i.e. less retransmissions at the link layer level). Through this 
example, we illustrate that in case of noisy channels, HARQ- 
II outperforms ARQ. Moreover, as fewer packets are dropped 
at the transport layer, the retransmissions at this layer level 
are mainly spurious: we can perform further works with TMT 
with the aim to reduce the number of spurious retransmissions 
and unnecessary congestion window decreases in the context 
of satellite links. 

We note that the results obtained with TMT directly assist 
with understanding the impact of reliability schemes at the 
link layer on the performance of upper layers. To the best of 
our knowledge, TMT is the first tool enabling such studies 
based on real physical layer traces. 

VI. Conclusion 

In this article, we present a Trace Manager Tool (TMT) that 
computes the equivalent link layer output of a real physical 
trace as a function of the reliability schemes used (FEC, 
ARQ, HARQ). We propose a module for ns-2 that takes into 
account this link layer trace in order to study the impact of 
link layer reliability schemes on the performance of transport 
protocols. We provide theoretical expressions for the through- 
put efficiency and the recovery delay for these reliability 
schemes over bursty channels, enabling us to validate our 
TMT tool and to drive fast evaluations of their performance. 
The resulting model allows better assessment of the benefits 
brought out by the reliability mechanisms in terms of QoS for 
the applications. 

In future work, we plan to further utilise TMT and the ns- 
2 module in order to drive an extensive study of the impact 
of link layer reliability schemes on transport protocols perfor- 
mance, in the context of hybrid space-terrestrial networks. 
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